feat: Add late interaction model training support for retrieval by rnyak · Pull Request #2283 · NVIDIA-NeMo/Automodel

rnyak · 2026-05-20T17:18:29Z

What does this PR do ?

Updated train_bi_encoder.py to support local ColBERT-style (multi_vector) pooling by adding colbert_scores_and_labels(), which computes MaxSim scores with query and passage attention-mask handling. The train and validation paths now route ColBERT models through this scoring function instead of standard pooled embedding contrastive scoring.

The changes in the MR add support for multi_vector pooling and maxsim scoring with and without distributed in-batch neg training.

The latest commits:

Added detach_distributed_inbatch_negatives arg (default True), so distributed in-batch negatives can use the previous efficient detached behavior by default, or preserve remote passage gradients when set to false.
Renamed ColBERT-style scoring to multi_vector/MaxSim, kept colbert as a backward-compatible alias, and made distributed MaxSim scoring more memory efficient by scoring one passage slot at a time.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?

copy-pr-bot · 2026-05-20T17:18:33Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

rnyak · 2026-05-20T23:56:19Z

/claude review

Signed-off-by: Ronay Ak <ronaya@nvidia.com>

…egatives Signed-off-by: Ronay Ak <ronaya@nvidia.com>

add late interaction for retrieval

4d5e76e

rnyak requested review from a team, HuiyingLi, adil-a, akoumpa and hemildesai as code owners May 20, 2026 17:18

rnyak self-assigned this May 20, 2026

rnyak added the enhancement New feature or request label May 20, 2026

rnyak marked this pull request as draft May 20, 2026 17:18

rnyak temporarily deployed to public May 20, 2026 17:40 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 20, 2026 17:46 — with GitHub Actions Inactive

remove redundant query masking for colbert

4b31317

Signed-off-by: Ronay Ak <ronaya@nvidia.com>

rnyak temporarily deployed to public May 22, 2026 00:15 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 22, 2026 00:16 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 22, 2026 00:17 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 22, 2026 00:23 — with GitHub Actions Inactive

support colbert pooling with inbatch neg

70b9223

Signed-off-by: Ronay Ak <ronaya@nvidia.com>

rnyak temporarily deployed to public May 27, 2026 13:58 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 27, 2026 14:00 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 27, 2026 14:05 — with GitHub Actions Inactive

oliverholworthy reviewed May 28, 2026

View reviewed changes

Comment thread nemo_automodel/recipes/retrieval/train_bi_encoder.py Outdated

oliverholworthy reviewed May 28, 2026

View reviewed changes

Comment thread nemo_automodel/recipes/retrieval/train_bi_encoder.py Outdated

oliverholworthy reviewed May 28, 2026

View reviewed changes

Comment thread nemo_automodel/components/models/common/inbatch_neg_utils.py

oliverholworthy reviewed May 28, 2026

View reviewed changes

Comment thread nemo_automodel/recipes/retrieval/train_bi_encoder.py Outdated

preserve distributed MaxSim gradients for late-interaction in-batch n…

8c65f90

…egatives Signed-off-by: Ronay Ak <ronaya@nvidia.com>

rnyak temporarily deployed to public May 29, 2026 14:47 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 29, 2026 14:50 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 29, 2026 14:56 — with GitHub Actions Inactive

make distributed in-batch negative grads configurable

8d2d10c

rnyak temporarily deployed to public May 29, 2026 20:14 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 29, 2026 20:16 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 29, 2026 20:23 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add late interaction model training support for retrieval#2283

feat: Add late interaction model training support for retrieval#2283
rnyak wants to merge 5 commits into
mainfrom
rny/late_interaction_retrieval

rnyak commented May 20, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented May 20, 2026

Uh oh!

rnyak commented May 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rnyak commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Before your PR is "Ready for review"

Uh oh!

copy-pr-bot Bot commented May 20, 2026

Uh oh!

rnyak commented May 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rnyak commented May 20, 2026 •

edited

Loading